Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system
نویسندگان
چکیده
This paper summarizes the BUT-AGNITIO system for NIST Language Recognition Evaluation 2009. The post-evaluation analysis aimed mainly at improving the quality of the data (fixing language label problems and detecting overlapping speakers in the training and development sets) and investigation of different compositions of the development set. The paper further investigates into JFA-based acoustic system and reports results for new SVM-PCA systems going beyond BUT-Agnitio original NIST LRE 2009 submission. All results are presented on evaluation data from NIST LRE 2009 task.
منابع مشابه
The L2F Language Verification System for NIST LRE 2009
This paper presents a description of the INESC-ID’s Spoken Language Systems Laboratory (LF) Language Verification system submitted to the 2009 NIST Language Recognition evaluation. The LF system is composed by the fusion of eight individual sub-systems: four phonotactic systems and four acoustic based methods. Language recognition results have been submitted for the “closed-set”, “open-set” and...
متن کاملBUT language recognition system for NIST 2007 evaluations
This paper describes Brno University of Technology (BUT) system for 2007 NIST Language recognition (LRE) evaluation. The system is a fusion of 4 acoustic and 9 phonotactic subsystems. We have investigated several new topics such as discriminatively trained language models in phonotactic systems, and eigen-channel adaptation in model and feature domain in acoustic systems. We also point out the ...
متن کاملMultilevel and channel-compensated language recognition: ATVS-UAM systems at NIST LRE 2009
This paper presents the systems submitted by ATVS – Biometric Recognition Group at 2009 language recognition evaluation, organized by the National Institute of Standards and Technology of United States (NIST LRE’09). Apart from the huge size of the databases involved, two main factors turn the evaluation into a very difficult task. First, the number of languages to be recognized was the biggest...
متن کاملA Study of the Influence of Speech Type on Automatic Language Recognition Performance
Automatic language recognition on spontaneous speech has experienced a rapid development in the last few years. This development has been in part due to the competitive technological Language Recognition Evaluations (LRE) organized by the National Institute of Standards and Technology (NIST). Until now, the need to have clearly defined and consistent evaluations has kept some real-life applicat...
متن کاملFusing language information from diverse data sources for phonotactic language recognition
The baseline approach in building phonotactic language recognition systems is to characterize each language by a single phonotactic model generated from all the available languagespecific training data. When several data sources are available for a given target language, system performance can be improved using language source-dependent phonotactic models. In this case, the common practice is t...
متن کامل